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5 SITE - SPECIFIC DOUBLE STRANDED DNA ENDONUCLEAS E 

This invention relates generally to a protein, more 
specifically to an endonuclease protein which has not 

10 previously been purified or characterized from its 

natural biological source. This endonuclease is novel and 
extremely useful because it cleaves double-stranded DNA 
at specific, infrequent sites, for which endonucleases 
were not previously available. The resulting fragments 

15 are of great value for human gene mapping because the 
cleavage site is a sequence ordinarily encountered in 
genomic DNA, and because cleavage by the endonuclease 
produces relatively larger fragments than characteristic 
of those produced by many previously available 

20 endonucleases. This invention also includes methods for 
purifying the endonuclease and for cleaving DNA by use of 
the endonuclease. 

1M Restriction Endonucleases? 

25 

One of the essential tools molecular biologists use 
to delve deeper into the mysteries of life contained in 
the structure of DNA, the genetic material, is a 
molecular scissors called a restriction endonuclease. 
3 0 There are many such enzymes which are capable of cutting 
DNA at specific sites (see Lewin, 1987 for review). 

Restriction enzymes (restriction endonucleases) 
recognize specific short sequences of DNA (usually 
35 unmethylated DNA) and cleave the duplex molecule, usually 
at the target recognition site, but sometimes elsewhere. 
In some instances, the recognition site is specific, but 
the cleavage site is located some distance away from the 
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recognition site and does not appear to be at any 
specific sequence. 

"Duplex" refers to the double stranded composition 
5 of the DNA molecule. The cleavage induced by 

endonucleases is usually at specific sequences of 
approximately 4-6 base pairs. A base pair is a union of 
purines or pyrimidines in the DNA duplex. There are four 
such bases and they pair in specific unions: adenine with 
10 thymine, (A-T) , guanine with cytosine (G-C) . 

Fragments generated by endonucleases are amenable 
for further analysis of their nucleotide composition. 
Variation in the fragment sizes obtained from the same 
15 chromosomal locations among individuals , is referred to 
as restriction fragment length polymorphism (RFLP) . 

Restriction endonucleases are essential components 
of methods used to construct maps of the genetic 
material, although not all such endonucleases are useful. 
Some of the problems limiting their use are that cleavage 
may be too frequent using a particular enzyme, producing 
pieces too small to be useful. Another problem is that 
the sites attacked may have nucleotide sequences that are 
25 so unusual that they are not likely to occur in vivo . 
Some enzymes only cleave artificially engineered 
sequences. 

Restriction endonucleases are named by using three 
30 or four letter abbreviations identifying their origin, 
coupled with a letter and/or number designation which 
distinguish multiple enzymes of the same origin. An 
example of the nomenclature is EcoRl, one of the 
endonucleases derived from coll . Most of the 
35 endonucleases discovered initially were isolated from 

bacteria, in which they cleave DNA as part of the natural 
function of the cell. However, other organisms, for 
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example, yeast, can be used as a source of double-strand 
DNA cleaving endonucleases. 

Isolation of many endonucleases occurred because the 
5 bacteria from which the endonucleases were derived were 
able to distinguish between the DNA native to the 
bacteria and any invading foreign DNA. One of the ways 
bacteria recognize foreign DNA is by the absence of 
methyl groups at appropriate base pair sites. The 
10 bacteria protects its own DNA from cleavage by its own 
endonucleases, by methylation of its own DNA bases at 
appropriate target sites. Successful attack on bacteria 
by foreign DNA, for example by viruses, may be due either 
to the fact that the virus DNA has the same pattern as 
15 the host DNA, or alternatively, that mutations have 

caused defects in the ability of the bacteria to produce 
an endonuclease or to attack the foreign DNA. 
Endonucleases isolated from bacteria are of two types, 
one which is only able to cleave DNA, and another in 
20 which both restriction and methylation activities are 
combined. Some restriction endonucleases introduce 
staggered cuts with overhangs while others generate blunt 
ends. 

25 (B) Restriction Mapping 

Gene maps give the location of specific genes 
(specific DNA nucleotide sequences) that encode the 
primary sequences of protein gene products relative to 

30 each other and also localize the genes on specific 

chromosomes of higher organisms. A map of DNA obtained 
by using endonucleases to map breakpoints is called a 
restriction site map and consists of a linear sequence of 
cleavage sites. This physical map is obtained by 

35 extracting DNA from the chromosomes in cells, breaking 
the extracted DNA at various points with endonucleases, 
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and determining the order of cleavage sites by analysis 
of the fragments. 

Distances along the maps are measured directly in 
5 base pairs or, if distances are long, in megabase pairs. 
By comparing the sequences of DNA between relatively 
short distances, a DNA map is constructed in a stepwise 
fashion. A major goal of current research is to 
construct a map of the entire human genome. (The Human 

10 Genome Project r American Society of Human Genetics 

Symposium, Baltimore, Nov. 15, 1989.) Success in mapping 
human and animal genomes will require a selection of 
endonucleases which cleave at a large variety of sites 
which occur in the DNA of living organisms, not just in 

15 artificial sequences. 

DNA fragments produced by the action of 
endonucleases are separated on the basis of size by 
agarose or polyacrylamide gel electrophoresis. An 

20 electric current is passed through the gel, causing the 
fragments to move down it at a rate depending on length; 
the smaller fragments move more rapidly* The result of 
this migration in a gel, is a series of bands each 
corresponding to a fragment of a particular size. Many 

25 different endonucleases are used for gene mapping, and 
large numbers of overlapping fragments are analyzed. 
Sequential cleavage using different endonucleases 
produces a series of larger fragments broken down into 
smaller fragments. A hierarchy is then constructed based 

30 on the fact that there is complete additivity of length 
of the fragments within the original starting fragment. 
For example, a fragment of 2,100 base pairs may be broken 
down into 200 and 1900 base pairs, (see Lewin, 1987 for 
review) ♦ 

35 

Construction of an entire gene map for a species, 
for example construction of the human gene map, is a 
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difficult and tedious task. The larger the number of 
endonucleases available for restriction mapping, the 
easier and more sophisticated the genetic map 
construction. In particular, many endonucleases are 
5 needed which cleave at a variety of specific sites and 
which produce fragments of different lengths. To 
appreciate the magnitude of the mapping problem it should 
be noted that an estimated 3 billion base pairs contained 
in 22 pairs of human chromosomes called autosomes plus 
10 two sex chromosomes s comprise the human genome. 

Restriction maps represent advantages over older 
methods of mapping which identified a series of genetic 
sites because of the occurrence of DNA changes 

15 (mutations), because restriction maps can be obtained for 
any sequence of DNA. Their construction is not dependent 
upon the location of mutations, and no knowledge of the 
function of a particular sequence of DNA is required. 
However, restriction maps are related to, and are 

20 generally colinear with, "genetic" maps. 

Mutations which are deletions or insertions of base 
pairs may be detected in restriction maps by noting an 
alteration of the length of a restriction fragment in 

25 which the mutation lies. Base-pair change type of 

mutations may be detected if their presence inactivates 
or creates a cleavage site of a particular endonuclease, 
altering the length of the restriction fragments produced 
by cleavage in the area of DNA containing the mutation 

30 lies. 

(C\ Restriction Fragment Lencrth 
Polymorphisms fRPLP) 

35 Different alleles (conditions of a gene) may lead to 

the production of different proteins and subsequent 
variation in the phenotype, (the detectable physical, 
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biochemical, or physiologic makeup of the organism) . 
Variation of DNA within populations is called genetic 
polymorphism. Even if the polymorphism does not lead to 
detectable changes in the phenotype by physical 
appearance or biochemical assays, genetic polymorphism 
may be detectable by variations in the DNA restriction 
fragment lengths (RFLP) . Polymorphic variation in the 
restriction map therefore is independent of gene 
function. 

RFLP's have numerous applications including as 
markers for paternity testing or determining the location 
of specific genes. For example, mutant genes responsible 
for inherited diseases such as Huntington's Chorea, a 
progressive neurological degeneration, have been 
localized to specific chromosomes in humans by 
correlating inheritance of RFLP's in families with the 
inheritance of the particular clinical condition. RFLP 
patterns of family members who are normal are compared 
with patterns of family members who are affected with a 
particular genetic disease. 

fP) Otfrer i^ep foy flestrlntl.QTi 
Endonucleases 

Another use of restriction endonucleases is to 
create and use cloning vectors for the transmission of 
DNA sequences. For this purpose, the gene of interest 
needs to be attached to the vector fragment. One way 
this may be accomplished is by generating complementary 
DNA sequences on the vector and on the gene of interest 
so that they can be united (recombined) . Some 
restriction endonucleases make staggered cuts which 
generate short, complementary, single stranded "sticky 
ends" of the DNA. An example of such an action is that 
effected by the EcoRl endonuclease which cleaves each of 
the two strands of duplex DNA at a different point. 
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These cleavage sites lie on either side of a short 
sequence that is part of the site recognized by the 
endonuclease. When two different DNA molecules are 
cleaved with EcoRl the same sticky ends are generated 
5 which enables them to combine with each other. The DNA 
fragment can then be retrieved by cleaving the vector 
with EcoRl to release the gene. 

CD Exons and Introns 

10 

The restriction map of DNA may not correspond 
directly with the coding sequence of messenger RNA 
produced by the DNA because DNA sequences of the total 
gene may consist of exons and introns. Exons are that 

15 part of the DNA code that appear in the messenger RNA. 

Most, but not all, exons code for proteins. Introns are 
DNA sequences that are usually spliced out of the RNA 
product before the messenger RNA proceeds to be 
translated into proteins. Splicing consists of a 

20 deletion of the intron from the primary RNA transcript 

and a joining or fusion of the ends of the remaining RNA 
on either side of the excised intron. Presence or 
absence of introns, the composition of introns, and 
number of introns per gene, may vary among strains of the 

25 same species, and among species having the same basic 
functional gene. Although in most cases, introns are 
assumed to be nonessential and benign, their 
categorization is not absolute. For example, an intron 
of one gene can represent an exon of another. A mosaic 

30 gene is defined as one which is expressed through the 
splicing together of exons carried by one molecule of 
RNA. In some cases, alternate or different patterns of 
splicing can generate different proteins from the same 
single stretch of DNA (Lewin, 1987) . 



35 
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12 ) Mitochondrial DNA: Yeast MitQptipnOy^ 

The DNA contained in mitochondria (cell organelles 
which contain extranuclear DNA) represents an example of 
5 differences arising during evolution between the 

composition of genes with regard to the exon and intron 
sequences as well as non-coding sequences. For example r 
comparing the mitochondrial genes of yeast with those in 
mammalian systems, indicates that identical mitochondrial 
10 proteins are produced despite the disparity in evolution 
between these species. The yeast mitochondrial genomes, 
however, are much larger than those occurring in 
vertebrates due to the absence of introns in the latter 
and the presence of noncoding spacer DNA in the former. 

15 

Primary DNA sequence data are known for many yeast 
isolates (see deZamaroczy and Bernardi, 1986) in which 
interstrain differences are due to (i) a small number of 
large deletions/additions, mainly concerning introns; 

20 (ii) a large number of small (10-150 bp) 

deletions /additions located in the intergenic sequences; 
(iii) 1-3 bp deletions/additions and point mutations. In 
Sfrgch^romyces cerevesiae the size of mtDNA can range up 
to about 84 kilobases, approximately 2/3 of which is 

25 non-coding regions. There are more than 20 mitochondria 
per cell, i.e., approximately 4 genomes per 
mitochondrion. In comparison, vertebrate mitochondrial 
DNA is approximately 17 kilobase pairs. In the 
individual mitochondrion there are usually several copies 

30 of a single molecule of DNA. Moreover, there are very 
many mitochondria per cell. In plants and some 
unicellular eukaryotes, extranuclear DNA is also found in 
chloroplasts. 

35 The DNA within the mitochondria directs protein 

synthesis, just as nuclear DNA does. However, a finite 
number of proteins are produced. There are general 
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similarities in the machinery for gene expression in 
mitochondria of various species, rendering products and 
information derived in one genus applicable to many 
others* For some products, the cytochrome c oxidase in 
5 yeast, for example, protein synthesis combines factors 
produced in the cytoplasm encoded by nuclear DNA with 
those synthesized directly in the mitochondria. 
Mutations identifying almost all the mitochondrial genes 
have been detected. Nuclear mutations that interact 

10 with, or abolish, the effects of these mutational 

complexes in the mitochondria, have also been found. 
Genes coding for many of the same functions are present 
both in the yeast and the mammalian mitochondrial genome, 
making the yeast mitochondria a good model system for 

15 testing theories on gene expression in higher organisms. 
The mitochondrial genome of one of the yeast strains, 
Saccharomvces cerevisiae has provided both information on 
genetic expression, as well as products which can be 
useful for analysis of higher systems. (Lewin, 1987: 

20 Butow 1985) . 

In the invention described herein, a new and unique 
restriction endonuclease has been isolated and purified. 
One obstacle to purification and characterization of this 

25 enzyme in the past has been the inability to accumulate 
sufficient amounts of the protein, a problem which has 
been solved by methods disclosed in this invention. A 
preferred source of the endonuclease described in this 
invention is yeast mitochondria from a special strain. 

30 The endonuclease has wide applications for in vivo or in 
vitro cleavage of double-stranded DNA from many genera. 

In certain aspects, this invention is directed 
towards a substantially purif ied endonuclease preparation 
35 having an apparent molecular weight of 31,000 daltons as 
determined by SDS polyacrylamide gel electrophoresis. 
This endonuclease is capable of cleaving double-stranded 
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DNA at specific sites (Figure 1). Within these sites, 
certain specific nucleotides are essential for cleavage, 
as indicated in Figure 1; at other sites, base 
substitution is compatible with cleavage. 

5 

The endonuclease described in this invention is 
further defined as having a biological activity of up to 
about 100 units/mg of protein in the crude extract. One 
unit of endonuclease activity is defined as that amount 
10 of enzyme that catalyzes the cleavage of 50ng of a DNA 

molecule in one hour at 3d°C, although other definitions 
of activity would be within the scope of this invention. 

Purification of the endonuclease by standard protein 
15 purification techniques was accompanied at each 
sequential step by increased activity of the 
endonuclease. Specific examples comprise a biological 
activity up to about 34,000 units/mg after 
phosphocellulose chromatography; 50,000 units/mg after 
20 Affigel Heparin chromatography; 200,000 units after gel 
filtration; and 500,000 units/mg after DNA affinity 
chromatography (Figures 3-6, Table 1). 

The endonuclease is translated from a fusion between 
25 the upstream exons of the mitochondrial cytochrome 

oxidase subunit I gene (coxl) of yeast (Figure 1) and the 
open reading frame (ORF) within the 4th intron (aI4a) of 
the ( coxl ) gene. The endonuclease was capable of 
cleaving recipient DNA molecules near the site of yeast 
30 mitochondrial coxl aI4a intron insertion (Figures 7-8) . 

The endonuclease is also capable of acting as a maturase 
under certain conditions. Examples of such conditions 
include presence of a point mutation in the intron 
reading frame of the mim-2 mutation, (Dujardin et al., 
35 1982) or the presence of the nuclear NAM2 gene (Herbert 
et al. , 1988) . 
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A preferred source of the endonuclease is 
mitochondria, more specifically yeast mitochondria. An 
object of this invention was to describe a method of 
isolating and purifying the endonuclease from yeast 
mitochondria, more specifically flar.charPWYCe? cerevisjae 
mitochondria. The WA12 strain of yeast containing mtDNA 
derived from strain ID41-6/161/PZ27 was a preferred 



source. 



10 A method is disclosed for preparing the 

endonuclease. The preferred embodiment comprises 
culturing yeast that are incapable of splicing the aI4a 
intron of the coxl gene, thereby accumulating sufficient 
amounts of endonuclease to use for isolation, 

15 purification, and characterization. The method further 
comprises preparing mitochondrial extracts from yeast, 
fractionating the extracts, and selecting a fraction or 
fractions which contain the endonuclease disclosed in 
this invention. The endonuclease is further identified 

20 by its ability to cleave double stranded DNA at the 
specific site: 



C 

AA C CT TA 
GGi — I — |Gi — I GG Ti-i G[— | GG 

1T 6 G T C A2££ A g|aA^gJtAT^ 



Another object of this invention is a method of 
cleaving DNA. This method comprises preparing the 
endonuclease described herein, by the method disclosed in 
this invention, and incubating the isolated and purified 
endonuclease protein with DNA so as to effectuate the 
endonucleolytic cleaving of the DNA. 

Other objects and advantages of the invention will 
become apparent upon reading the following detailed 
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description and upon reference to the drawings in which 
aspects of the invention are illustrated as follows: 

FIGURE l: The PNA cleavage site of the al4<* annn^* 
5 emflonucjease. 

The cleavage site specificity of the endonuclease is 
shown. Boxed residues are those believed to be essential 
for cleavage. The nucleotide changes indicated above the 
10 non-boxed nucleotides, are those that have been tested 

and found to permit cleavage. It is possible that other 
nucleotides will be identified which can substitute at 
these and other positions. The enzyme cut sites are 
indicated by the vertical and horizontal lines. 

15 

FIGURE 2: Intron configurations of the coxl aenA 

The coxl gene of yeast mtDNA contains up to ten 
introns, three of which were found recently in coxl genes 

20 from newly studied isolates (Kotylak et al., 1985; Ralph, 
1986; Ralph et al., unpublished data). The intron 
configuration of this gene in strains D273-10B and 
noyfreiisifr Y-12,656, is shown in lines 1 and 2, 
respectively. The coxl gene structure of a recombinant 

25 resulting from conversion of aI4a is shown in line 3. 

All three strains shown also contain introns all and aI2. 
Because two of the new introns are present in strains 
used in this study, a new nomenclature for coxl gene 
introns was used. Previously described introns 3 and 4 

30 are renamed aI3a and aI4a, respectively. One new intron, 
located between aI3a and aI4a, is called aI3y (because 
another new intron, aI30, is present between aI3a and 
al3y) in qouglasil [Kotylak et al., 1985]). Exon 
sequences are indicated by dark boxes whereas introns are 

35 designated by open or shaded boxes. Arrows indicate 
regions sequenced as a basis for this invention. The 
asterisk denotes the location of two sequence differences 



WO 91/09942 



PCT/US90/07644 



-13- 

between strains D273-10B and S. nor bens is located in exon 
5a. The abbreviations used are: B, BamHI; Be, BCII; H, 
Haelll; E, EcoRl (Wenzlau et al., 1989). 

5 FIGURE 3: Purification of the restriction 

endopucleggg fry phQfiFhQg? A lalfigg chyop^togr^hy 

Fraction la (see Table I) was dialyzed (M* cutoff , 
10,000) against ice cold buffer containing 50 mM 

10 potassium phosphate, pH 7.5, 10% glycerol, 2 mM EOTA, 2 
mM DTT (Buffer A) until an ionic equivalent of 
approximately 100 mM KCl was achieved. The sample was 
loaded onto a phosphocellulose column (5 mg protein per 
packed ml of phosphocellulose) , previously equilibrated 

15 at a flow rate of 20 ml/hour with Buffer A. The column 
was washed with 120 ml of Buffer A containing 200 mM KCl 
at a rate of 60 ml/hour and then a 240 ml linear gradient 
was applied using Buffer A containing 200 mM KCl and 1 M 
KCl at a rate of 8 0ml /hour. The aI4a endonuclease 

20 activity eluted at 0.7 M KCl. Active fractions were 

pooled (fraction II, Table I) and dialyzed against Buffer 
A until an ionic equivalent of 50 mM KCl was reached. 

FIGURE 4: Purification of the restriction 
25 endonuclease by affigel herarin chromatography 

The dialyzed fraction II (Table I) having 
endonuclease activity and resulting from phosphocellulose 
chromatography (Figure 3) was applied to an Affigel 

30 Heparin column, previously equilibrated with Buffer A 

(0.1 mg protein per packed ml of Affigel Heparin), at a 
rate of 2.5 ml/hour. The column was washed with 15 ml of 
buffer A containing 50 mM KCl at 7 ml/hour and eluted 
with a 60 ml linear gradient of Buffer A containing 100 

35 mM to 600 mM KCl at a rate of 15 ml/hour. The enzyme 
eluted at 300 mM KCl, and the active fractions were 
pooled (fraction III, Table I) . 
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FIGURE 5: Purification of the restriction 
endonuclease bv gel f iltration 

Fraction III (Table I, Figure 4) was applied to a 
Sephacryl HS 200 column (1.4 cm x 85 cm) equilibrated 
with Buffer A containing 100 mM KCI at a rate of 3.2 
ml/hour. The column was calibrated with blue dextran 
2000, B-amylase, yeast alcohol dehydrogenase, lactate 
dehydrogenase, boving serum albumin, carbonic anhydrase 
and cytochrome c. The column was eluted with the Buffer 
A and 0.95 ml fractions were collected. Fractions 
containing aI4a endonuclease activity were pooled 
(fraction IV) . 

FIGURE 6: Purification of the r estriction 
endonudease by DNA affinity chr omatography 

Fraction IV (Table I, Figure 5) was applied to a DNA 
affinity column equilibrated with Buffer A containing 100 
mM KCI at 1.5 ml /hour. The column was washed with 6 ml 
of the same buffer at 6 ml /hour and eluted with 18 ml of 
a linear gradient of Buffer A containing 100 mM KCI to 
800 mM KCI at a rate of 9ml/hour. The aI4ct endonuclease 
activity eluted at 300 mM KCI. The active fractions were 
pooled and mixed with an equal volume of 50 uM potassium 
phosphate, pH 7.5, 80% glycerol and 2 mM EDTA and stored 
at -200C (fraction V, Table I) . 

FIGURE 7: In vivo double-stranded cuts in recipient 
DNA molecules 

A restriction fragment spanning the intron insertion 
site was examined for double-stranded breaks in mated 
cells. The strains 5DSS/D273-10B and COP-19/norbensis 
were mated, and mtDNA was isolated at the beginning of 
mating and 3, 6, 9, and 24 hr after mating began. The 
DNA was digested with either Hpa II (A) or Hae III (B) 
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and analyzed to detect double-strand cuts near the sites 
of insertion for omega and al4a, respectively. MtDNA was 
fractionated on 6% polyacrylamide gels, 
electrophoretically transferred to Hybond-M membranes, 
5 and hybridized to end-labeled oligonucleotides specific 
to 2 IS RRNA gene exon sequences (A) or to a sequences 
(B) . A 0.6 kb fragment (arrow) corresponding to an in 
vivo double-stranded cut near the site of omega insertion 
was seen from 6-9 hr after initiation of mating in the 
10 Hpall digest (A). Similarly, a 0.9 kb fragment (arrow) 
consistent with a double-stranded cut near the site of 
al4a insertion was present from 6-9 hr after mating in 
Haelll digests (B) . 

15 FIGURE 8: Mitochondrial extracts contain an 

endonucleas e that cleaves recipient DNA near the site of 
intyon inserts 

Mitochondrial extracts were assayed for their 

20 ability to cleave recipient (R) or donor (D) DNA. 
Plasmids pDRl (recipient) and pJWl (donor) were 
linearized with EcoRl and 3' end-labeled. These 
substrates were incubated with mitochondrial extracts 
from wild-type strain ID41-6/161 and mutant derivatives 

25 PZ27, 6, and K. Cleavage of pDRl DNA at or near the site 
of intron insertion would yield products of 2.1 kb and 
3.0 kb (lanes 3, 5, and 9). Cleavage of pJWl DNA at the 
equivalent exon site would yield products of 1.2 kb and 
2.8 kb. The substrate and extract used are indicated 

30 above each lane: 39 ng of extract protein from strain 

PZ27 was added to lanes 3, 4, and 5; 39 ng and 3.65 ng of 
extract protein from wild-type strain 141-6/161 was added 
to lanes 6 and 7, respectively; and 150 ng of extract 
protein from mutant K and 182 ng of extract protein from 

35 mutant G was added to lanes 8 and 9, respectively. 
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FIGDRE 9: Preliminary cleavage site determine™ nf 
the endonucleaae 

Single-stranded DNA from phages RS18 and RS19 was 
labeled using the Sequenase (United States Biochemical 
Corporation) protocol and the universal sequencing 
primer. This material was then extended either using 
dideoxynucleotides (to generate a sequencing ladder) or 
with unlabeled nucleotides (to generate the substrate) . 
Substrate samples were cleaved with a mitochondrial 
extract from strain PZ27 (see Figure 7) . Figures 9A and 
9B show the sequencing gels generated by this protocol 
using phages RS19 and RS18 as templates, respectively. 
In each panel, lanes 1-4 represent chain termination 
reactions with dideoxy G, A, T, and C, respectively; lane 
5 is the product of a cleavage reaction on the 
"end-labeled" substrate; lane 6 shows this region of the 
gel using uncut substrate to demonstrate no premature 
primer extension products in the vicinity of the cleavage 
site. Figure 9C shows the sequence surrounding the 
cleavage site. Intron sequences are shown in lower case 
letters while exon sequences are in upper case letters. 
The large arrow indicates the point of intron insertion. 
The sites of cleavage on each strand are shown by small 
arrows. The exon sequences of donor and recipient are 
nearly identical; two differences in the exon between 
donor and recipient are shown by asterisks. The bracket 
sets an upper limit on the boundaries of the recognition 
site (based on the data of Figure 10) . 

FIGURE 10: location o f the recognition sequence Q f 
the al4a-encoded endonucleaae 

The recognition sequence was delimited by 
determining the extent of each sequencing ladder of 
Figure 9 that was resistant to cleavage by the 
endonuclease activity. The products of sequencing 
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reactions using phage RS19 as a template are shown in 
Figure IOA, lanes 1-4 (terminated with dideoxy G, A, T, 
and C, respectively) . These sequencing ladders were used 
as a substrate for cleavage reactions using 500 ng of 
5 protein extract from strain PZ27 in lanes 5-8. The 
ladder was resistent to cleavage until a recognition 
sequence was generated. All chains with further 
additions were efficiently cleaved. Figure 10B shows the 
results of a similar experiment using RS18 as a template. 
10 Lanes 1-4 show the products of the sequencing ladder 

(terminated with dideoxy G, A, T, arid C, respectively). 
In lanes 5-8, the products of the ladder were cleaved 
with extract from strain PZ27. The maximum boundaries of 
the recognition sequence are shown in Figure 9C. 

15 

This invention is directed to a new and unique 
endonuc lease, and to methods of preparing and using this 
enzyme. 

20 Although other endonucleases have been discovered 

which cleave DNA at specific sites, there is clearly a 
need for more endonucleases, in particular those with 
special and useful properties, to facilitate gene 
mapping. The endonuclease described herein has the 

25 capability of producing relatively larger fragments of 
DNA and also to act on restriction sites (cleavage 
sites) , that are commonly encountered in the genome. 
Some enzymes operate only on unusual sequences, e.g., 
only GC base pairs. 

30 

The general objective of this invention is to purify 
and apply a new endonuclease protein with useful 
characteristics not previously available to the art. 
Prior to the invention described herein it had not been 
35 possible to isolate and purify the endonuclease protein 

from its natural source so that its sequence and function 
could be analyzed. 



WO 91/09942 PCT/US90/07644 

-18- 

The endonuclease which is disclosed by the current 
invention will cleave ordinary DNA sequences, use of 
this endonuclease for gene mapping will facilitate and 
e^and the resolution of determination of DNA sequences 
5 within fragments cleaved by the endonuclease. 

in one of the preferred embodiments, the 
endonuclease as extracted from mitochondria of strain 

10 unit? 27 ^ a bi0l ° giCal aCtiVlty ° f UP * *» 
10 units/mg per protein. These mitochondrial extracts 

further processed by phosphocellulose chromatography 

resulted in the purity increasing to 34,000 units/mg. 

Application of Aff igeie heparin chromatography increased 

the purity of the endonuclease up to about 50,000 

unxts/mg. Fractionation by gel filtration yielded a 

biologxcal activity of 200,000 units/mg. Further 

purification by DNA affinity chromatography yields a 

biological activity of 500,000 units/mg (Figures 3-6). 

20 ,o\r, Pr0tSin ^ MOle - 1 - weight of 

20 20 kilodaltons as determined by SDS-PAGE. 

A maximum of 14 base pair sites are believed to be 

ZT1T I" CleaVage * ^ ^ at least "ve residues 
known to be essential for the cleavage; in other 

win s'tm SOmS SUbStltUti ° ns ™ P-itted and cleavage 
will still occur (Figure l) . 

Most of the endonucleases discovered to date have 

30 ™» f" toCterlal a " d « e jessed in t 

*° sali becaue. it is .ore difficult to purify endonucieases 
from more complex system. This lisitation wes also due 
to the fact that bacteria are incapable of producing 
quantities of a specific protein in large enough amounts 

35 L° esployed^ 181 ° n W6iCh PUrl£ieitim ~~ 
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The mitochondrial genomes of many yeast species have 
been mapped and shown to contain both introns and exons 
(deZamaroczy and Bernardi, 1986). Two of the most 
informative loci for genetic investigations are the 
5 mosaic genes cob (which codes for cytochrome b) and 6xi3 
(which codes for subunit I of cytochrome £ oxidase. 
(Figure 2). Many of the introns in these two genes have 
open reading frames, sequences which are read as 
extensions of the exon immediately preceding it. An 
10 unusual circumstance is that most introns in 

mitochondrial yeast genomes may be expressed as proteins. 

In common with yeast, the mammalian systems have 
genes for cytochrome b, cytochrome oxidase and subunit s 

15 of ATPase. The known mitochondrial gene products vary 
little among eukaryotes as diverse as fungi and 
vertebrates. The yeast mitochondrial genome consists of 
large numbers of conserved DNA sequences plus 
interruptions by an assortment of so called optional 

20 sequences. The genome of a given yeast strain appears to 
be very stable. Coding sequences among strains are 
conserved, but optional sequences are present or absent 
at various positions in many combinations. The wide 
range of functional combinations of introns arose during 

25 speciation of yeast. Further combinations of those 
probably arose in laboratory strains by unselected 
mitochondrial recombination during the many decades of 
genetic manipulation in which current laboratory strains 
were derived (Butow, 1985) . The operational definition 

30 of optional sequences is that the presence or absence of 
these sequences do not affect phenotypic expression of 
the exon coding region. 

Much has been learned about the mechanism of RNA 
35 splicing by examination of yeast systems. RNA splicing 
normally removes the introns from the transcripts of 
genes by breaking the phosphodiester bonds at the 
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exon- intron boundaries, and subsequently forming bonds 
between the ends of the free exons. There are various 
mutants of yeast which are unable to splice the introns,* 
therefore the precursors accumulate in the cells. This 
5 can occur in either the nucleus or the mitochondrial 

genes or both. The mitochondrial introns in yeast may be 
divided into two types, group 1 and group 2. Group 1 
introns appear to have a common secondary structure, 
short conserved sequences internally, but no specific 

10 conservation of sequences at the splice junctions. Some 
of these introns in the coxl gene and the single intron 
of the large mitochondrial ribosomal RNA gene of 
Saccharomvces cerevisiae r are self -splicing. Mutations 
in other genes, however, can affect the occurrence of 

15 splicing. In both group 1 and group 2 there are some 

introns which must translate an extensive coding sequence 
to splice the intron containing it. Some introns encode 
a protein involved in splicing, called RNA maturases. 
Maturases appear to result from translation of both exons 

20 and introns. Both homologous and non-reciprocal 

mitochondrial recombinations occur in genetic crosses of 
yeast. Non-reciprocal exchanges involving some optional 
sequences have been reviewed by Perlman et al. (1989). 
The best characterized system of non-reciprocal exchange 

25 analyzed in yeast is the omega system. Omega is a 1.1 kb 
optional intron in the 2 is RRNA gene. In genetic crosses 
between omeg^ positive and omega negative yeast, the 
progeny are usually omega positive. That is, there seems 
to be unidirectional transfer of the intron. Mutants 

30 lacking this polarity of transfer occur either as base 
changes close to the site where the intron would be 
inserted, or as mutations in the intron reading frame 
preventing its translation. The protein product of the 
omega intron is not involved in omega splicing. 

35 

The hypothesis generated from studying the omega 
system is that the protein coded by the intron in the 
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omega positive strain ( fit /or omega transposase/ISccI) 
recognizes the DNA site where the intron should be 
inserted in an omega negative strain. This causes it to 
be preferentially inherited. The protein coded by the 
5 omeaa intron has endonuclease activity recognizing a 

specific sequence in the omega negative gene as a target 
for cleavage of the doublestranded DNA (Jacquier and 
Dujon, 1985; Colleaux et al., 1986). The recognition 
site lies at the site where the intron is inserted. The 
10 double-strand break probably initiates what is called a 
gene conversion, a process in which the sequence of the 
omega positive gene is copied and replaces the sequence 
of the omeoa -negative gene. 

15 Cleavage of intronless f omega -negative ) DNA near the 

site of intron insertion is required for conversion, and 
it is believed that introns carrying sequences which 
accomplish gene conversion may have once been independent 
elements that coded their own splicing or interacted with 

20 the DNA of the recipient strain. This creates a 

transposable element (transposon) • Finding that such 
gene conversions are required for cleavage of the 
recipient DNA by a site specific endonuclease, leads to 
the expectation that isolation and purification of such 

25 endonucleases would provide extremely valuable not only 
in understanding the genetic mechanisms of these 
conversions but also could provide a valuable reagent for 
gene mapping studies. 

30 When two parental strains having distinguishable 

mitochondrial DNAs are mated, the progeny are expected to 
represent the parental DNA in some aspects. 
Recombination refers to the new union of DNA present in 
separate parents, in a single progeny DNA. Mitochondrial 

35 DNA from different parental strains is capable of 
recombining. 
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Through studies of gene conversion in yeast 
mitochondria, it has been determined that one of the 
mechanisms achieving gene conversion employs an 
endonuclease aI4a which cleaves the recipient DNA at 
s specific sites (Wenzlau et al., 1989). For one of these 
endonucleases, the aI4o-encoded protein, there exist, 
however, yeast strains that overproduce or overexpress 
the enzyme relative to wildtype yeast strains because of 
various mutations of the yeast mitochondrial genome. An 

10 example of such a strain of yeast is the WA12 strain 
containing the PZ27 mutant mitochondrial DMA of 
Saccharomyces cerevisiae. strain PZ27 was isolated in 
1978. in the mitochondrial genome of this strain the 
cytochrome oxidase gene f coxi l is present, but the 

15 cytochrome b gene is deleted. This configuration blocks 
splicing of ai4o causing the protein encoded by that 
intron reading frame to be present at high levels in 
mitochondria. This protein is a site-specific 
endonuclease that is translated from a fusion between the 

20 upstream exons of coxi and the open reading frame within 
the intron aI4a. All yeast have the coxl gene, and most 
contain intron aI4a. A few strains, however, have lost 
the capability of splicing the intron, leading to 
overproduction of an endonuclease. 

25 

In wild-type yeast strains splicing of this intron 
4a is assisted by a product of the fourth intron of the 
cytochrome b gene called the bI4 maturase. Splicing 
would normally remove the intron. However, there are 

30 several strains in which there is an inability to splice 
aI4ct resulting in an accumulation of the endonuclease 
produced by that region of DNA. This overproduction 
occurs, for example if there are mutants at the bI4 locus 
or deletions of the bI4 reading frame. In strain WA12 

35 containing mtDNA from strain PZ27 there is a deletion of 
the cob gene, consequently a deletion of intron bI4, 
leading to continued production of the endonuclease 



WO 91/09942 

-23- 

produced by the coxl gene because there is no splicing of 
the intron. Otherwise the strain has a wild-type coxl 
gene based on complementation studies, 

5 Some other strains exist with defects in the bI4 

maturase that would lead to this overproduction. In fact 
any strain lacking bI4 maturase and thereby unable to 
splice out aI4a will lead to overproduction of the 
endonuc lease. A preferred embodiment is to use the WA12 

10 strain as a source of crude extracts of mitochondria. 

After culturing the yeast, mitochondrial extracts can be 
prepared by standard methods of lysing cells and 
fractionation. These extracts can be further purified by 
being subjected to phosphocellulose chromatography, 

15 affigel heparin chromatography, gel filtration and DNA 
affinity chromatography. Any other suitable methods of 
purification of proteins known to those skilled in the 
art may be applied. 

20 The endonuclease described herein was capable of 

cleaving double-stranded DNA at a specific site to 
achieve gene conversion in recipient strains. (Figures 
7-8). It also has the potential to become a maturase 
under certain conditions. A maturase is an 

25 intron-encoded RNA splicing protein. It has been 

reported that the maturase coded in yeast by the fourth 
intron of the cytochrome b gene (bI4) is essential for 
splicing the fourth intron of the gene encoding subunit 1 
of the cytochrome c oxidase ( coxl gene) . The coxl intron 

30 4a encodes a protein that is closely related structurally 
to the bI4 maturase, but in wild-type yeast strains is 
not utilized in splicing either the fourth intron of coxl 
or safe. However, the aI4a product can be activated to a 
new maturase form by either a point mutation (mim-2) 

35 (Dujardin et al., 1982) or the presence of particular 
mutant forms of the nuclear NAM2 gene (Herbert et al., 
1988). 
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Although intron-encoded endonucleases have not 
previously been isolated and purified from yeast 
mitochondria, the use of special laboratory strains in 
this invention which continue to produce these products 
5 due to lack of splicing, permitted the isolation of 
endonucleases in amounts sufficient to be isolated, 
purified and characterized. The function and structure 
of one of these endonucleases was determined for the 
first time as an aspect of this invention because of the 

10 increased amounts obtained from the special strain 

WA12/PZ27 of Saccharomvces cerevisiae . The specific site 
of cleavage of double-stranded DNA was determined by 
presenting known sequences of DNA of different lengths to 
the endonuclease and studying the resulting cleavage 

15 products. A partial recognition site determination is 
shown in Figure 10. 

The purified fraction is then selected which 
comprises an endonuclease capable of cleaving DNA at 

20 specific sites* This capability can be tested by various 
assays. The cleavage site, plus flanking DNA sequences 
can be introduced into various plasmids to use as an in 
vitro assay. Essentially any plasmid DNA containing the 
cleavage site will be suitable. In a preferred 

25 embodiment the assay used comprised a plasmid pRSX which 
contains a 292 base pair, Rsal-Hindlll fragment that 
includes the cleavage site from Saccharomvces capensis 
cloned into the vector PBS(+) from stratagene. 

30 Two assays were used to determine the activity of 

the endonuclease: 

1. A supercoiled DNA preparation of pRSX was incubated 
with the endonuclease, and cleavage activity was 
35 determined by gel electrophoresis of the subsequent 

fragments, if cleavage has occurred there will be 
conversion of the supercoiled DNA to the linear form 
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of the plasmid which can be determined by gel 
electrophoresis ; 

PRSX DNA was linearized with Sjal. using standard 
techniques the linear plasmid was end-labelled, 
extracted, and endonuclease activity was determined 
by the production of two radioactive fragments after 
incubation with the enzyme for which activity is to 
be determined. 



An object of this invention is to cleave DMA at 
specific sites. Determining whether this was achieved by 
preparing the endonuclease protein as defined above, was 
done by incubating the enzyme with DNA so as to 
15 effectuate the cleaving of DNA. 

Strains shown in Figure 2 were used in crosses to 
study the transmission and recombination behavior of 
three group 1 introns of the S2£L gene. The data shown 
20 in Figure 7 indicated that the mated cells contain an 
endonuclease activity capable of cleaving recipient 
genomes in the vicinity of the site of conversion. 
Experiments employing petite mutants of yeast which do 
not express mitochondrial genes, showed that no 
25 conversion occured in petites. The conclusion from these 
studies was that a product of mitochondrial protein 
synthesis is needed for al4a conversion. Results of 
crossing strains of yeast differing in their intron 
configuration of three mosaic genes f coxl . cab, 2 IS 
30 rRNA) , and use of various strains having point mutations 
of aI4or, led to the conclusion that the al4a intron 
likely encodes the information needed for aI4a 
conversion. The aI4a reading frame must be intact to 
effect conversion. 



In this invention, purification and subsequent 
application of an enzyme not previously purified, was 
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achieved and found to cleave DNA as shown in Figure 1. 
The endonuclease requires Mg* + at a concentration of about 
25mM for activity and shows high activity at pH 7.5 in 
the presence of 200mM salts of monovalent cations. 
5 Activity does not appear to be stimulated by the addition 
of nucleoside triphosphates (see also Figure 8) . Other 
properties of the endonuclease illustrated in Figure 8 
are: lane 3: the mitochondrial extract from strain P227 
cleaves the recipient plasmid once near the intron 

10 insertion site (this linear product can be circularized 
by DNA ligase); lane 4: donor DNA is not cleaved; lane 5: 
only recipient DNA is cleaved; lanes 6 & 7 show that 
cleavage activity is not detected in extracts of 
mitochondria from a wild-type strain capable of efficient 

15 splicing of aI4a; lane 8: no cleavage using extract from 
an aI4a mutant with a truncated intron ORF; lane 9: 
cleavage from an extract of an aI4a mutant with an intact 
intron ORF. 

20 The endonuclease has been shown to be active in E. 

coli by use of an artificially engineered sequence of the 
endonuclease coding sequences in a transforming vector, 
using the universal genetic code. However, it is highly 
toxic in E, coli probably due to its cleavage of double 

25 stranded E. coli DNA (Delahodde et al., 1989). 

A major advantage of the purified endonuclease, is 
that it cuts DNA relatively infrequently, and 
consequently is particularly useful for genome mapping. 

30 If all 14 nucleotides are required for cleavage, then one 
can estimate 1 to 2 cleavage sites per human chromosome. 
However, because there is some relaxation in the 
specificity of the cleavage site, empirical tests leading 
to actual determination of cuts effected by the 

35 endonuclease in clonal portions of human chromosome 

number 3, show one site about every 200 kilobase pairs. 
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Since the DNA sequence which the inventors have 
discovered to encode for the endonuclease is available 
(Bonitz et al. 1980), it could be engineered for 
expression in, e.g., E. coli (Delahodde et al. 1989). 
However, the preferred embodiment is to prepare this 
endonuclease from yeast mitochondrial extracts because of 
possible variations in the protein during the course of 
molecular engineering and expression in other species and 
because of its apparent toxicity to bacterial cells. 

A major current scientific project is to map the 
entire human genome. This is an expensive, 
labor-intensive, and difficult task which will be 
facilitated by use of the endonuclease described in this 
invention. Human gene mapping permits detection of 
carriers of abnormal genes, and isolation and cloning of 
genes which cause abnormal human conditions. A goal of 
the mapping project is further elucidation of gene 
action, paving the way for therapy. Complete mapping of 
the human genome will also contribute to the field of 
preventive medicine by making it possible to determine 
the susceptibility of individuals to specific genetic 
disorders. Because the phenotype (expression of such 
genes) represents the interaction between genes and 
environment, preventive action based an knowledge of the 
genetic complement of the individual may be achieved by 
altering their environment (diet, drug, medication) to 
reduce or eliminate the severity of the abnormal genetic 
defects. The ultimate goal, of course, is to identify 
these genes so that direct gene therapy will be possible. 
If the genomic sequences are known, then normal genes may 
be directed into the genotype, perhaps at early stages of 
development, to counteract the effects of abnormal genes. 
A further goal of gene mapping is to understand the 
interaction and interrelationships of genes on different 
parts of the chromosome and on different chromosomes. 
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The purification, isolation and use of the 
endonucleasd which is an object of this invention are 
further described in the following examples: 

Example ^ ; X east Strain,, Purification of the 
endonuclease is made possible by the use of a strain or 
strains that overproduce the enzyme. In wild-type yeast 
strains, splicing of aI4a is assisted by the "maturase" 
product of the fourth intron (bl4) of the cytochrome b 
gene (cofe) ; mutants of the bl4 maturase, or deletions of 
all or portions of the bI4 reading frame, result in a 
block in splicing of aI4a. Because the mRNA for aI4c 
endonuclease is a fusion of the intron reading frame with 
those of the upstream exons, the inability to splice aI4a 
results in an accumulation of the endonuclease mRNA and, 
hence, an overproduction of the protein. Therefore, to 
facilitate purification of the endonuclease protein, a 
strain (WA12) , containing the mutant mitochondrial 
genome, P227, which is deleted for the SBh gene, is used. 
(Note, however, that any strain with a defect in the 
synthesis of the b!4-encoded maturase would overproduce 
the aI4a endonuclease and therefore be suitable as a 
starting strain for purification.) 

Example 2; PNA Cleavage sitft T The cleavage site 
for the endonuclease is the sequence shown in Figure 1. 
This site is cleaved with high efficiency by crude and 
purified preparations of the endonuclease which catalyzes 
a 4 bp staggered double strand break as shown in Figure 
1, leaving 3* OH overhangs. Preliminary determination of 
the cleavage site is shown in Figure 9. The boundaries 
of the recognition site are shown in Figure 10. Methods 
are detailed in Wenzlau et al. , 1989. 

Example 3; Assays for endomml o g Se ae jH v 4f Yl For 
an in vjt£o. assay of endonuclease activity, the cleavage 
site shown in Figure 1, plus flanking DNA sequences was 
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introduced into various plasmids. Essentially any 
plasmid DNA containing the cut site is suitable. For 
the assays described herein, the plasmid pRSX containing 
a 292 bp Rsa I - Hind III fragment that includes the cut 
site from caoensis cloned into the vector pBS (+) from 
Stratagene was used. 

Two assays were conveniently used to follow enzyme 
activity: 



a) Cleavage of supercoiled DNA . A supercoiled 
preparation of pRSX was incubated with the endonuclease 
under the assay conditions described below, and cleavage 
activity was evident, following gel electrophoresis of 

15 the sample, as a conversion of the supercoiled to the 
linear form of the plasmid; 

b) Linearization and end-labelling of pRSX, pRSX 
DNA was linearized with Seal. After phenol extraction 

20 and ethanol precipitation, the linear plasmid was 
end-labelled with 32 P dATP and T4 DNA polymerase by 
standard procedures. The labelled DNA was then extracted 
by phenol and the unincorporated dNTPs were removed by 
gel filtration. The radioactive pRSX DNA was ethanol 

25 precipitated and resuspended in sterile water. 

Endonuclease activity was followed by the production of 
two radioactive fragments following incubation with 
active enzyme. 

30 The endonuclease assay conditions used were as 

follows: Reaction mixtures (0.025 ml) contained 25 mM 
Tris-HCI, pH 7.5, 20 mM MgCl 2 , 100 mM KC1, 2 mM DTT and 
100 ng pRSX plasmid DNA. Incubation was carried out at 
30°C for 30 minutes. Reactions were stopped by being 

35 placed on ice and a 1/10 volume gel loading buffer was 
added. The samples were analyzed by electrophoresis in 
0.7% agarose gels containing 1 ng/ml ethidium bromide. 
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For purposes of this invention, one unit of endonuclease 
activity was defined as that amount that catalyzes the 
cleavage of 50 ng of the DNA substrate in one hour at 
30°C. Other definitions used in the art would be 
consistent with this invention. 



10 



Example 4; Protocol for the Pu rification of_fcfag 
Endonuclease. The following table, Table I, is an 
overview of the protocol which was employed to purify the 
enzyme, and also indicates the degree of purification 
that was achieved at each step. 



15 



20 



25 



30 



Table Is SUMMARY OF PURIFICATION 
OF TUB RESTRICTION ENDONUCLEASS 



Sequential 

Purification 

Steps 



Activity* yield 

U X 10" 3 



Protein 

(mg) 



Specific 
Activity 

(U/mg) 



I. 


Crude Extract 


26 


100 


216 


100 


Ia. 


40-60% (NB\) 2 S0 4 


3 




114 




II. 


Phosphocellulose 


20 


79 


0.6 


34,000 


III. 


Aff igel Heparin 


5 


19 


0.1 


50,000 


IV. 


Gel Filtration 


4 


16 


0.02 


200,000 


V. 


DNA Affinity 


1 


4 


0.002 


500,000 



*For purposes of this invention, one unit of 
endonuclease activity was defined as that amount 
that catalyzed the cleavage of 50 nanograms of 
DNA substrate in one hour at 30 °C. Other 
definitions of activity are within the scope of 
this invention. 



35 



( a ) Preparation of Cr ude Mitochondrial Extract and 
Ammonium Sulfate Fractionation 



A Saccharomyces cerevisiae strain (WA12/PZ27) was 
grown in medium composed of 1% yeast extract, 1% 
bacto-peptone and 2% galactose to late log phase at 30 °C. 
The cells were collected by centrifugation, washed with 2 
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mM EDTA, at pH 8, and resuspended at 2 ml/g, wet weight, 
in buffer containing 33 mM potassium phosphate, pH 5.8, 1 
M sorbitol and 2mM EDTA. Mureinase was added at a ratio 
of 4 mg/g, wet weight, and the mixture was incubated on a 
5 platform shaker at room temperature for one hour. 

Spheroplasts were collected by centriguation at 1000 x g 
for 10 minutes at 3°C and resuspended in ice cold 
breakage buffer which contained 50mM Tris-HCl, pH 7.5, 
0.6 M sorbitol, 2 mM EDTA, 2mM DTT and protease 

10 inhibitors (which were also included in all subsequent 
steps of the purification): PMSF (1 mM), leupeptin (1 
^g/ml) and aprotinin (1 /ig/ml) . Spheroplasts were lysed 
by shaking with glass beads. The lysate was recovered 
and the glass beads were washed 3 times with breakage 

15 buffer. The combined spheroplast lysate was centrifuged 
at 1000 x g for 10 minutes at 3°C. 

A high salt extract of the crude supernatant was 
prepared by addition of 0.33 volume of 4 M KC1 to achieve 

20 a final concentration of 1 M KC1. MgCl 2 and DTT were, 
added to final concentrations of 10 mM and 2 mM, 
respectively. The high salt extract was stirred in the 
cold for 30 minutes and centrifuged at 15,000 x g for 15 
minutes at 3°C. The supernatant (fraction I, Table I) 

25 was recovered and diluted 2 -fold by addition of ice cold 
buffer containing 50 mM Tris-HCl, pH 7.5, 2 mM EDTA and 2 
mM DTT. One-tenth volume of a 25% streptomycin sulfate 
solution was added and the extract was stirred in the 
cold for 30 minutes followed by centrifugation at 15,000 

30 x g for 15 minutes at 3°C. 

Solid ammonium sulfate was added to the crude 
supernatant over a period of 30 minutes to achieve 40% of 
saturation at 0°C. Stirring was continued for another 30 
35 minutes to achieve 60% of saturation at 0°C. Stirring 

was continued for another 30 minutes and the precipitate 
was collected by centrifugation and discarded. The 
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supernatant was recovered and additional solid ammonium 
sulfate was added over a period of 30 minutes and the 
precipitate, which contained the aI4o endonuclease 
activity, was collected by centrifugation at 15,000 x g 
for 15 minutes at 3°C and resuspended in buffer 
containing 10 mH potassium phosphate, pH 7.5, 45% 
glycerol and 2 mM EDTA and stored at -20°C (fraction la 
in Table I) . 

The next step in the purification was to subject 
fraction la (Table I) to phosphocellulose chromatography. 
The details of the procedure are described in Figure 3. 
Fractions showing endonuclease activity were pooled 
(fraction II in Table I) and dialyzed against Buffer A 
until an ionic equivalent of 50mM KC1 was reached. This 
dialyzed fraction was then applied to an Affigel Heparin 
column (Figure 4) . The active fractions from this 
procedure were pooled to form Fraction III (Table I) . 
This pooled fraction was then applied to a Sephacryl 
HS200 column (Figure 5) . The active fractions obtained 
from gel filtration were pooled to form Fraction IV 
(Table I) . This fraction was applied to a DNA affinity 
column (Figure 6) . The pooled active fractions (V, Table 
I) were mixed with an equal volume of 5omM potassium 
phosphate, pH7.5, 80% glycerol and 3mM EDTA and stored at 
-20°C. 

Example 5; Analysis of mtpwA. Mitochondrial DNA was 
prepared by the procedure of Bingham and Nagley (1983) . 
Restriction enzyme digestions were performed as 
recommended by the enzyme supplier (Bethesda Research 
Laboratories) . Fragments were separated on 1% agarose or 
6% polyacrylamide gels and transferred by the well-known 
method of Southern (1975) or electrophoretically to 
Hybond-N membranes (Amersham) . Hybond-N membranes were 
prehybridized in 6x SSC containing lox Denhardt's, 0.5% 
SDS, and 0.5 mg/ml of calf thymus DNA at 65 'C. 



WO 91/09942 



PCT/US90/07644 



-33- 

Oligonucleotides were end-labeled with T4 polynucleotide 
kinase and allowed to hybridize to membranes for 2 hr at 
42 *C. Blots were washed three times for 15 min each at 
50 *C in 6x SSC. Other probes, generated by the 
5 multiprime DNA labeling system (Amersham) , were 

hybridized to membranes overnight at 42 # C. Blots were 
washed twice at 42 *C for 15 min each in 2x SSC, 
containing 0.1% SDS, and twice for 15 min each in 0.2x 
SSC, containing 0.1% SDS. All filters were exposed to 
10 Kodak XAR-5 X-ray film. 

Example 6: Extract Preparation and Endonuclease 
Assay . Galactose-grown cells were harvested at late 
logarithmic phase, and a mitochondrial fraction was 

15 isolated as described by Hudspeth et al. (1980) . All 
further manipulations were done at 4°C. Mitochondria 
were resuspended in a solution containing 0.6 M sorbitol, 
25 mM Tris-HCl (pH 7.5), 25 mM EDTA, and 30% Percoll and 
were further purified by banding in self -forming Percoll 

20 gradients obtained by centrifuging the resuspended 

mitochondria at 15,000 x g for 20 min. The mitochondrial 
band was isolated and diluted with 10 vol of a solution 
containing 0.5 M NH«C1 and 25 mM Tris-HCl (pH 7.5) and 
centrifuged at 15,000 x g to pellet the mitochondria. 

25 Mitochondria were lysed in a solution containing 50 mM 

Tris (pH 7.5), 100 mM NH*C1, 10 MM MgCl 2 , 2 mM DTT, and 1% 
NP40. The extract was clarified at 15,000 x g, and the 
supernatant was used for endonuclease assays. Protein 
concentration was determined by the BCA protein assay 

30 (Pierce Chemical Company). 

For the cleavage experiment described in Figure 8, 
the substrates pDRl and pJWl were linearized with EcoRl 
and the overhangs filled in with [or- 32 P]dATP and dTTP 
35 using the Klenow fragment, unincorporated nucleotides 
were eliminated on a spin column and the DNA was phenol 
extracted, ethanol precipitated, and resuspended in a 
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solution containing 10 mM Tris (pH 7.5) and 1 mM EDTA. 
Reactions were performed with 10 ng of substrate in a 25 
Ml reaction. Reaction conditions for this study were 25 
mM Tris(pH 7.5), 25 mM MgCl 2 , 200 mM NH 4 C1, and 2 mM DTT 
5 at 30 °C, Reactions were allowed to proceed for 5 min and 
were stopped with 2.5 Ml of a solution containing 50 mM 
Tris (pH 7.5), 100 mM EDTA, 2% SDS, 15% Ficoll, and 1% 
bromophenol blue. The reaction products were separated 
on a 1% agarose gel and visualized by autoradiography. 

10 

Example 7: Mapping of thg Cleavage and Recognition 
Sites, Using a modification of the Seguenase® (United 
States Biochemical Corporation) sequencing strategy, the 
cleavage site was mapped by generating double-stranded 

15 substrates from single-stranded templates (phages RS18 
and RS19) bearing the cleavage site. Two nanograms of 
the universal sequencing primer was annealed to 1 fig of 
each template in a 20 /*1 Sequenase® labeling reaction. 
The reaction was extended with 1 mM dNTPs to generate the 

20 "end-labeled" substrate. One-fifth of that material was 
cleaved with PZ27 extract as described above. An aliquot 
(1/50) was separated alongside a sequencing ladder 
generated from the rest of the labeling mixture and 
separated on 5% polyacrylamide gels. To determine, the 

25 boundaries of the recognition site, cleavage reactions 
were performed on the sequencing ladders themselves 
(Figures 9 and 10) . 



While the invention is susceptible to various 
modifications and alternative forms, specific embodiments 
thereof have been shown by way of example in the drawings 
and have been described in detail. It should be 
35 understood, however, that it is not intended to limit the 
invention to the particular forms disclosed, but on the 
contrary, the intention is to cover all modifications, 
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equivalents, and alternatives falling within the spirit 
and scope of the invention as defined by the appended 
claims. 
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CfrAiMg; 

1. An endonuclease protein having an apparent molecular 
weight of about 29,000 daltons as determined by SDS-PAGE, 
5 and which is capable of cleaving double-stranded DNA at a 
specific site identified as follows: 



C 



10 




15 2. The endonuclease protein of claim 1, further defined 
as a mitochondrial-derived enzyme. 

3. The endonuclease protein of claim 2, further defined 
20 as a mitochondrial enzyme derived from yeast. 

4. The endonuclease protein of claim 3, further defined 
as a Saccharomyces cer evis iae-dar i ved enzyme. 

25 

5. The endonuclease of claim 1 further defined as having 
a biological activity of up to about 100 units/mg protein 
in crude extracts. 

30 

6. The endonuclease of claim 5 further defined as having 
a biological activity of up to about 34,000 units/mg 
after phosphocellulose chromatography. 

35 
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7. The endonuclease of claim 6 further defined as having 
a biological activity of up to about 50,000 units/mg 
after affigel heparin chromatography. 

5 

8 . The endonuclease of claim 7 further defined as having 
a biological activity of up to about 200,000 units/mg 
after gel filtration. 

10 

9. The endonuclease of claim 8 further defined as having 
a biological activity of up to about 500,000 units/mg 
after DNA affinity chromatography. 

15 

10. The endonuclease of claim 1 wherein the endonuclease 
is translated from a fusion between the upstream exons of 
the mitochondrial cytochrome oxidase subunit I gene 
( coxl ) of yeast and the open reading frame (ORF) within 

20 the 4th intron (aI4a) of the coxl, gene. 

11. The endonuclease of claim I wherein the enzyme is 
capable of cleaving recipient DNA molecules near the site 

25 of yeast mitochondrial coxl intron (aI4a) insertion. 

12. The endonuclease of claim 1 which is also capable of 
acting as a maturase under certain conditions, said 

30 conditions comprising the coincidence of a point mutation 
in the intron reading frame (the mim-2 mutation), or the 
presence of the nuclear NAM2 gene. 

35 13. A method for preparing an endonuclease having the 
capability of cleaving double stranded DNA, comprising 
the steps: 
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culturing yeast that are incapable of splicing the 
aI4a intron of the coxi gene; 

preparing a mitochondrial extract from the yeast; 
fractionating the extract; and 

selecting a fraction or fractions which comprises £ 
endonuclease as defined by claim 1. 



5 



14. The method in claim 13 wherein the yeast comprises 
the WA12/PZ27 strain of Saccharose cerevis^o 

15. The method in claim 13 wherein the yeast comprises a 
strain incapable of expressing the bI4 maturase activity 
of the cytochrome b gene (sab) in an appropriate culture 
media. 



. A method of cleaving DNA comprising the steps of: 
preparing an enzyme as defined by claim 1; and 



incubating the enzyme with DNA so as to effectuate 
the endonucleolytic cleaving of the DNA. 



WO 91/09942 PCT/US90/07644 



i//0 



c 

AA C 
GG I — | G 
TT 6 GT 



0 



CT TA 
GG T i—i Gi— 1 GG 
ATC£| AjGj Aa|gJTAT 



FIGURE 1 



WO 91/09942 



PCT/US90/07644 



tllO 



D273-10B \- 

QlZa aI4a aI5y 



Norbensis 



1 kb 




B Be H Be H 



EBc 



Recombinant 

aI3a aI4a aI5/3 aI5y 




T i i 
B Be Be 



EBc 



FIGURE 2 



WO 91/09942 



PCT/US90/07644 



Endonuclease Activity, u/ul 
■♦— . Protein, mg/ml 
•n — [KCIJ.M 




20 40 60 

Fraction number 



FIGURE 3 



WO 91/09942 PCT/US90/07644 




Fraction number 



FIGURE 4 




PCT/US90/07644 



0 10 20 30 40 50 

Fraction number 



FIGURE 5 



• 

WO 91/09942 



PCT/US90/07644 




FIGURE 6 



WO 91/09942 PCT/US90/07644 



© 



2IS rRNA gene 

100 bp 



° OOUBLE 
x STRANDED 
JL CUT 



: — # . 

I 1 O.Skb 



CD 

0 .2 
*- w 

1 c 
fO o» 

oj o r 



HOURS 



Q 2 0 3 6 9 24 





- 1.9 



- 0.8 
-4-0.6 



L 



J 0.6 kb 



QI3; 



Cox I gene 



DOUBLE 
STRANDED 
CUT 



I00 bp 

« 

o - 



— * 



j 1.5 lb 



CO 

O .52 

P I HOURS 

S I ' 0 3 6 9 24 ' 




— 1.5 



•*- 0.9 



J .9 kb 



FIGURE 7 



WO 91/09942 



# 

PCT/US90/07644 



SUBSTRATE 
EXTRACT 



a: 


| p 


o: 


Q 


+ 
a 


or 






or 










N. 












'1 


cv 


C\J 


CM 




*- 










M 


rsj 


Nl 














CL 


CL 


a. 











-5.1 
-4.0 

- 3.0 

- 2.1 



1 2 3 4 5 6 7 8 



FIGURE 8 



WO 91/09942 



PCT/US90/07644 



V/o 




FIGURE 9 



WO 91/09942 



PCT/US90/07644 



® © 

r— CLEAVED— 1 p CLEAVED -i 

GAT C.'g A T C 1 G A T C 'g A T C 1 



/ 



G 
A 
A 
G 

— A 



T 7*r» 

A .'V^; 



1 2 3 4 5 6 7 8. 1 2 3 4 5 6 7 8 



A 

/\ 

— G 



FIGURE 10 



# 



WO 99/20795 PCT/GB98/03081 

25 

Example 4 

Quantification of Normal KIT and Splice M utant KIT flntron 17 ntl 

As the splice site mutation is present in only one of the duplicated regions of I 
and not in the duplicated region off, the various genotypes can be expected to 
have the attributes described in Table 3. 
TABLE 3 



Genotype Copies of Normal Copies of KIT Ratio of normal KIT 

KIT containing the splice to splice mutant KIT 

mutation 



I/I 2 2 1:1 

I/i 2 1 2:1 

i/i 2 0 2:0 

I/I p 3 1 3:1 

fa 3 0 3:0 



10 

Due to the dominance of allele J, three of the genotypes in Table 2 are carried 
by white animals and therefore can not be identified by phenotypic 
characterisation. Quantification of the relative amounts of the normal KIT gene 
and the splice mutant KIT gene allows the ratio between the two to be 
15 calculated, and therefore the genotype of individual animals predicted. This was 
achieved by quantification of two DNA fragments following Malll digestion. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 



26 



PCT/GB98/03081 



The amount of 134 bp fragment, representative of the normally spliced KIT 
gene, and of 54 bp fragment, representative of the splice mutant KIT, were 
measured following electrophoresis using GeneScan software. 

i. PCR to Produce DNA for Quantification 

As described in example 2 section i. The reverse primer KIT35 is labelled with 
the ABI fluorescent dye FAM at the 5' end 

ii Restriction Enzyme Digestion 
As described in example 2 section ii. 

iii Electrophoresis and Quantification of DNA Fragments 

Following digestion, 0.5 \i\ of the reaction volume was mixed with 2.5 pi of 
deionised formamide, 0.5 (il of GS350 DNA standard (ABI) and 0.4 nl blue 
dextran solution before being heated to 90°C for 2 minutes and rapidly cooled 
on ice. Three |il of this mix was then loaded onto a 377 ABI Prism sequencer 
and the DNA fragments separated on a 6% polyacrylamide gel in 1 X TBE 
buffer for 2 hours at 700 V, 40 mA, 32 W. The peak area of fragments 
representative to both the normal and splice mutant forms of KIT were 
quantitated using the GeneScan (ABI) software. 

iv. Ratio Calculations 

The peak area value of the 134 bp fragment (normal KIT) was divided by twice 
the peak area value of the 54 bp fragment (splice mutant KIT) in order to 
calculate the ratio value for each sample. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 PCT/GB98/03081 

27 

v. Results 

Analysis was performed on animals from the Swedish wild pig/Large White 
intercross pedigree for which genotypes at / have been determined by 
conventional breeding experiments with linked markers. Figure 5 and Table 4 
5 show the ratio of normal to mutant KIT calculated for animals from each of the 
three genotype classes, I/I (expected ratio 1:1), I/i (expected ratio 2:1) and I/f 
(expected ratio 3:1). The results are entirely consistent with the expected ratio 
values and indicate that the three genotype classes can be distinguished using 
this method. 

10 

TABLE 4 

Ratio of the Two KIT Forms in Different Dominant White Genotypes in a 
Wild Pig/Large White Intercross 

15 

Expected Observed Ratio Number 
Ratio (Normal:Mutan Tested 
(Normal: t) 
Mutant) 

±SE 

I/I White 1:1 1.15 ±0.075 13 

Iff White 3:1 3.11 ±0.084 12 

I/i White 2:1 2.23 ±0.109 14 



Figure 5 illustrates that the range of ratio values calculated for the two 
genotypes /// and I/f do not overlap. This enables animals carrying the / allele 
to be identified and the frequency of the allele within different pig breeds 
20 determined. Ratio values were calculated for 56 Landrace and 33 Large White 
animals and the results are shown in Figure 6. A clearly bimodal distribution is 



Genotype Phenotype 
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observed with 7 Landrace and 3 Large White individuals having a ratio value of 
approximately 3 or above, suggesting them to be heterozygous carriers for the 
/ allele (genotype I/f). This means / has gene frequency estimates of 6.25% 
(7/1 12 chromosomes tested) and 4.5% (3/66 chromosomes tested) within the 
5 Landrace and Large White breeds respectively. 

Example 5 

Analysis for presence and qu antification of the porcine KIT splice mutation 
10 using the PE ABT TaoMan chemistry 

Method 

i. Preparation of template DNA for PCR 



15 



DNA was prepared as in example 3, section i 



ii. TaqMan® PCR reactions 



20 



TaqMan 



PCR reactions were set up as shown in table 5 



TABLES 



P CR mix for TaqMan® based splice mutation test 



Final Cone 11 Volume 



Reagent 



lOx TaqMan® Buffer A (Perkin Elmer) 
25mMMgCl 2 Sol n 
DATP 
DCTP 
DGTP 
DUTP 

Amplitaq Gold™ (5U/ul) (Perkin Elmer) 
AmpErase TM N-Glycosylase (lU/ul) (Perkin 
Elmer) 

KITTM -NEST-F (5uM) 
KITTM-NEST-R(5uM) 
KITTM FAM (5uM) 
KITTM TET(5uM) 



lx 2.50 ul 

5mM 5.00 ul 

200uM 0.50 ul 

200uM 0.50 ul 

200uM 0.50 |il 

200uM 0.50 ul 

0.05U/ul 0.25 ul 

O.OlU/ul 0.25 ul 



500nM 
500nM 
lOOnM 
lOOnM 



2.50 ul 
2.50 ul 
0.50 ul 
0.50 \i\ 
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25% Glycerol 8% 8.00 jil 

Porcine genomic DNA 1-00 1*1 

25.00 

; *L_ 

The PCR primers used were as described below: 

KITTM-Nest-F (5'-CTC CTT ACT CAT GGT CGA ATC ACA-3') and 

5 

KITTM -Nest-R (5'-CGG CTA AAA TGC ATG GTA TGG-3 '). 

The TaqMan® probes used were: 

10 KITTM-A FAM (5'-TCA AAG GAA ACA TGA GTA CCC ACG CTC- 
3') and 

KITTM -G TET (5*- TCA AAG GAA ACG TGA GTA CCC ACG C -3') 

1 5 The TaqMan® probes were prepared by Perkin Elmer and labelled with FAM 
and TET as indicated as well as the standard quenching group TAMRA. The 
lOx TaqMan® Buffer A, Amplitaq Gold™, AmpErase N-Glycosylase, NTP's 
and 25mM MgCl 2 used were part of the TaqMan® PCR Core reagent Kit, 
supplied by Perkin-Elmer. 

20 

The reactions were then placed into a Perkin Elmer ABI Prism 7700 Sequence 
Detector and the reaction carried out using the following thermal profile, 50°C 
for 2 minutes, 95°C for 10 minutes followed by 40 cycles of 95°C 15s, 62°C 
60s. The reactions were carried out under the control of 'Sequence Detector 
25 V.1.6' software using the 'Single Reporter' and Real-Time' options with the 
'Spectral Compensation' function activated. Upon completion of the run real- 
time profiles for each sample were examined on the ABI7700 to check for any 
samples giving highly irregular profiles which were then excluded. The 
thresholds for both dyes, Fam and Tet, were set so that they intercepted each 
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dye during the exponential phase of PGR. Following updating of the 
calculations in 'Sequence Detector VU.6' software results were exportated into 
MS Excel for further analysis. 

iii. Analysis of results 

Based upon the underlying theoretical principle that one cycle of PCR gives a 
doubling in the amount of cleavage of the quenching dye from the allele 
specific probe and therefore doubles the signal one would expect the threshold 
cycle numbers from the II and Ii genotypes analysed to be as below: 



Table 6: 

Theoretical results for TaqMan® analysis of genotype at the KIT splice 
mutation 



Genotype 


Copies KIT 1 


Copies KIT 2 


Theoretical Ct 


Theoretical Ct 




(G) 


(A) 


TET(G) . 


FAM(A) 


n 


2 


2 


X 


Y 


Ii 


2 


1 


X 


Y+l 



In theory the Ct for TET and FAM signals, represented as X and Y should be 
the same, as equal numbers of copies of the target sequences should be present 
in an II animal. However in practice this does not necessarily occur due to 
differences in the hybridization and cleavage efficiency of the two probes and 
variation in the setting of the threshold cycle between the two dye signals. The 
reduction in splice mutant containing (A) sequences relative to those ngj 
containing the splice mutation (G) in the Ii animals ie 2:1 G:A ratio rather than 
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1:1 as for II genotype, should lead to the FAM signal reaching the threshold 1 
cycle later than the TET signal in the genotype Ii animals. The actual results for 
samples tested are shown in Table 7. 

Table 7 

Ct values from analysis of II and Ii genotypes 

Sample Genotype Ct FAM (A) CtTET Ct FAM- Ct TET 



(G) 


1 


Ii 


24.68 


22.59 


2.09 


2 


Ii 


25.98 


23.62 


2.36 


3 


Ii 


26.54 


25.57 


0.97 


4 


Ii 


27.37 


24.78 


2.59 


5 


Ii 


24.94 


21.61 


3.33 


6 


Ii 


25.68 


22.1 


3.58 
Ii Mean = 2.49 


7 


n 


22.05 


23.78 


-1.73 


8 


ii 


24.22 


24.59 


-0.37 


9 


ii 


24.19 


23.85 


0.34 


10 


u 


23.66 


23.51 


0.15 


11 


ii 


24.35 


22.71 


1.64 


12 


ii 


22.82 


21.69 


1.13 


13 


n 


22.84 


22.7 


0.14 


14 


ii 


23.17 


22.9 


0.27 
Mean = 0.20 
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No 35 35 

Template 

No 35 35 

Template 

No 35 35 

Template 

No 35 35 

Template 

Despite variation around the mean values it can be seen from Table 7 that there 
is a significantly increased delay in the FAM signal reaching the threshold level 
(approximately 2 cycles) relative to the TET signal in Ii animals compared to II 

5 animals as predicted, reflecting the reduced number of copies of the splice 
mutant (A) sequence present in animals of the Ii genotype. Plotting of the 
individual samples on a scatter plot (Figure 7) shows clustering of the two 
genotypes with the Ii cluster shifted along the Ct FAM axis due to the reduced 
number of copies of the KIT2 (A) sequence for which the FAM probe is 

10 specific. 
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CLAIMS: 

1 . A method for determining coat colour genotype in a pig which comprises: 
5 (a) obtaining a sample of pig nucleic acid; and 

(b) analysing the nucleic acid obtained in (a) to determine whether a mutation 
is/is not present at one or more exon/intron splice sites of the KIT gene. 

10 2. A method as claimed in claim 1 wherein the analysis in step (b) is carried 
out to determine whether a mutation is/is not present at the exon 1 7/intron 17 
boundary. 

3. A method as claimed in claim 2 wherein the mutation consists of the 
1 5 substitution of the G of the conserved GT pair for A. 

4. A method as claimed in any one of claims 1 to 3 wherein the sample of 
nucleic acid is amplified prior to analysis. 

20 5. A method as claimed in claim 4 wherein the nucleic acid is genomic 



7. A method as claimed in claim 6 wherein the pair of suitable primers is: 



DNA. 



6. A method as claimed in claim 5 wherein amplification is carried out 
using PCR and at least one pair of suitable primers. 



25 
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5'-GTA TTC ACA GAG ACT TGG CGG C-3'); and 
5*-AAA CCT GCA AGG AAA ATC CTT CAC GG-3\ 

8. A method as claimed in any one of claims 5 to 7 wherein after 
5 amplification the nucleic acid is treated with a restriction enzyme, followed by 

analysis of fragment lengths. 

9. A method as claimed in claim 8 wherein the nucleic acid is treated with 
the restriction enzyme Main. 

10 

10. A method as claimed in claim 8 or claim 9 wherein the ratio of 
restriction fragment lengths is determined. 

11. A method as claimed in claim 4 wherein the nucleic acid is mRNA. 

15 

12. A method as claimed in claim 1 1 wherein the nucleic acid is amplified 
using RT-PCR. 

13. A method as claimed in claim 12 wherein the length of RT-PCR product 
20 is determined. 



14. A method for determining coat colour genotype in a pig which comprises the 
step of analysing a sample of pig KIT protein to determine whether the protein is the 
splice variant protein. 
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15. A kit for use in determining the coat colour genotype of a pig which 
comprises one or more reagents suitable for determining whether a mutation is 
present at one or more exon/intron splice sites of the KIT gene. 

5 

16. A kit as claimed in claim 15 which comprises one or more reagents for 
carrying out PCR and one or more pairs of suitable primers. 

17. A kit as claimed in claim 16 which comprises the following pair of 
10 primers: 



5'-GTA TTC ACA GAG ACT TGG CGG C-3'); and 
5'-AAA CCT GCA AGG AAA ATC CTT CAC GG-3\ 
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Sequence Alignment Across the Exon /Intron Border of KIT Exon 17 

Allele Gene Copy Sequence 

Exon 17 Intron 17 

I KITl AAT TAC GTG GTC AAA GGA AAC I GTG AGT ACC CAC GCT CTC CTG ACA GTC 
KIT2 |A 

I P KITl |G 

KITl |G 

i KITl |G 
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Ratio of normal to splice mutant KIT 
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Ratios for Landrace and Large White 
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Plot of Ct FAM vs Ct TFT for TaoMan based PCR 
analysis of p orcine KIT splice mutation genotype 
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